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Abstract 

We present the shock-free wave propagation requirements for massless fields. First, we 
briefly argue how the "completely exceptional" approach, originally developed to study the 
characteristics of hyperbolic systems in 1 + 1 dimensions, can be generalized to higher dimensions 
and used to describe propagation without emerging shocks, with characteristic flow remaining 
parallel along the waves. We then study the resulting requirements for scalar, vector, vector- 
scalar and gravity models and characterize physically acceptable actions in each case. 
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1 Introduction 

In this work, a brief version of which appeared in Q, we study the propagation of excitations of 
classical niassless field actions. In general, criteria for physical propagation of such waves can be 
derived in many ways. Here, we will only consider the "completely exceptional" (CE) approach |^, 
originally developed for systems in D = 1 + 1. Roughly speaking, complete exceptionality is the 
property ensuring that the initial "wavefronts" evolve so as to prevent the emergence of shocks, 
which, in general, result when the "characteristics" propagate at different speeds. As we are not 
aware of a rigorous procedure extending ideas developed at D = 2 to higher D, we will follow 
steps similar to those in D = 2, and then outline how to generalize them to higher dimensions. In 
the process, we show how the CE idea can be looked at in seemingly different ways and outline a 
derivation that fills the gap between the two viewpoints. We apply our criteria to massless spin 
0,1,2 nonlinear systems. 

We start, in Section |2|, by introducing the type of physical problems that we will study and 
develop the formalism that will be used throughout. Section |3| gives the analysis of characteristic 
surfaces, which are crucial to the CE idea. In Section ^, we give a simple example in D = 2, and 
demonstrate how shocks may be prevented for this particular problem. Motivated by this example, 
we next show how the introduced ideas can be extended to higher dimensions in Section |5[ This 
naturally leads to the CE concept and we show how one can view it in two seemingly different ways, 
which are explained in the text. In Section B, we study in detail the scalar field in I? = 4 using 
these two separate methods, derive the CE condition on it and argue as to how one can generalize 
the result to arbitrary D. Next, we turn to models of nonlinear electrodynamics in Section ^. Here 
we encounter particular models, the constraints on which not only automatically guarantee the CE 
property (as originally discussed in 0]), but also ensure that both polarizations of light propagate 



according to the same dispersion law, i.e. "no birefringence" [Q, ||. Hence we call these constraints 
the "strong CE" conditions. We also derive (for the first time to our knowledge) the regular CE 
requirement conditions (much weaker than strong CE) in the most general D = A case. Finally, in 
Section |^, we find that wide classes of gravity models share with Einstein the null nature of their 
characteristic surfaces. In three Appendices, we show the details of some calculations skipped in 
the text. 

2 The Formalism 

In this paper we will be dealing with systems of PDEs that are Euler-Lagrange equations of rel- 
ativistic actions. They will be linear in highest derivatives (quasilinear) and their coefficients will 
not depend on the coordinates explicitly. So, they can be reduced to a set of differential equations 
of first derivative order. Hence for U an A^- vector of fields, ^ an A^ x A^ matrix and B an A^- vector 
(both arbitrary smooth functions of C/), the equations of interest can always be written in the form 

M'{U){d^U) + B{U) = ^. (2.1) 

The theory of such equations in arbitrary dimensions is quite difficult, but we will be mainly 
interested in the evolution of the spatial boundary of a wave propagating into some given vacuum. 
So, with U some smooth (say at least C^) solution, at some initial time we have some spatial region 
outside of which the "state" is the "vacuum solution" U , and across the boundary surface the full 
solution U is continuous but its first derivative may not be. We want to consider the evolution of 
such initial "wavefronts" . 

We will follow the formalism developed for this situation in 0, H] and the references therein. 
Let the hypersurface, 5, specified by 

^{xn = , (2.2) 



denote the surface of evolution of the initial wavefront; i.e. the initial wavefront is the spatial 
surface f{0, x) = 0. Assume that the field U is continuous across S; so only the normal derivative 
can be discontinuous. Choosing a local coordinate system denoted by x'^ = {(p, ip^), the "first order 
discontinuity" in a given quantity / can be defined as 
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dip 



(2.3) 



where 



[X] ^ X\ 



ip=0+ ~ ^\ip=0 



SoX 



Then it is easy to check that 



5o{d^,U) = [d^,U] = {d^ip)5iU 



(2.4) 



(2.5) 



Here we are considering the possibility that 5iU is discontinuous. Taking "first order discon- 
tinuity" is then like differentiation, 



dif{U) = {Vuf)SiU 



(2.6) 



The generalization to quasilinear systems of higher order, say q, in derivatives is straightforward 
now. Define 
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and consider the case that 



6qU ^0 , 6rU = , 0<r < 



(2.7) 



(2.8) 



Notice that "taking the discontinuity" depends on the order of derivative. For example, if / has a 
second order discontinuity, i.e. 62/ / 0, then 5if = 0, but 6i{d^f) / 0. Hence in general one has 



6r dfj, = {dfj.ip) 6r+l . 



(2.9) 



3 Analysis of the Characteristics 

Taking the discontinuity of (p.l|), we find that, on S, 

{A^ip^)6iU = 0. (3.1) 

where A^ = A^{U). [Here (p^ = d^cp; henceforth, we will drop the subscript 1 on 5 and use 6U 
to mean the first order discontinuity in U, i.e. 6iU.] Since 6U ^ 0, we see that S must be a 
characteristic surface, i.e. 

det {Af^iPf,) = (3.2) 

must hold on S. Thus 6U is in the kernel of A'^ip^, for a given choice of root in (|3.2[) . 

We can assume that ( |2.1D can always be rewritten such that A^ is the identity matrix and 
also that we have a flat metric on spacetime. We next define the unit normal to S, 

n = ^ , (3.3) 



and the "characteristic eigenvalue" 



X = -^. (3.4) 

|V<^| 



So for a given choice of root, SU is always a linear combination of the right eigenvectors 
of An = ii • ^ for the corresponding eigenvalue A. In the hyperbolic case, the set of eigenvalues, 
X^^'{I = 1, . . . , N), are distinct, and the corresponding right (left) eigenvectors R^ (L^) are real and 
form a linearly independent set. 

For general A^ {A^ not necessarily equal to the identity matrix), A are just the roots of the 
characteristic equation (|3.2D and we have 



-LiA'^X'-^^ + LiAn = = AnRj-X^-^^A'^Rj ; (3.5) 

LiA^X^^^Rj + LjAnRj = = LjAnRj - LiX^^^A^Rj . (3.6) 



Then for the hyperbohc case, LfA^Rj = 0, {I ^ J), and one can always choose to normahze such 
that 

LiA^Rj = 5^^ . (3.7) 

The characteristic equation (3.2) is homogeneous of order N inp^ = ip^. By analogy, we can 
write it as 

H{x,p) = Q, (3.8) 

where we may introduce the explicit coefficients 

H{x,p)=G^'^-^'^p^,...p^^ . (3.9) 



Then, by homogeneity of H, 






'dpf. 



which can be written as 



p-VpH = (3.11) 



where pis the D dimensional vector with components {po, . . . ,pm-i)) ^^^ Vpif is the vector with 
components {dH/dpo, . . . , dH/dp(^D_i^). 

Since p is the normal to the hypersurface S, we see that the tangential vector is parallel to 
VpH, or that the curves 

are tangential on (|3.8| ). Notice that this is a set of curves, one for each root of the characteristic 
equation. In analogy to classical mechanics, the "momenta" then satisfy 



on S. This follows from -^ = for a tangential deformation and from the compatibility condition 
df.iPu — dyp^ = 0. We can eliminate s for t = x^ , and then H factors as 



TV 



H=\{{p^-h'), (3.14) 

7=1 

and pq can be fixed as one of the roots. Reparametrizing, for a given root po = h '^{U,pi), we can 
span the characteristic surface with the trajectories obeying 

dxf" _ dh^o dpf, _ dh^o 

ds dp^ ' ds ^x^^ 

4 An Example 

Consider the following D = 2 example pi. Take the simple PDE 

dtu + ud^u = . (4.1) 

Then it easily follows that the characteristic curve is 

dx , , , , 

— = u{x,t). (4.2) 

Clearly ( |4.1| ) and ( [4. 21 ) imply -Jj = along the characteristic curve, i.e. the characteristic curve is 
a line with constant u. So the "velocity", -^, is constant and the characteristics are nothing but 
straight lines. If we denote by cp the point where the given line is at initial time t = 0, then by 
integrating (^), 

x{t) =u{ip,0)t + ip . (4.3) 

This implicit equation for ip = (p{x, t) is just the equation for the given characteristic curve, 
parametrized by its initial point. So then one has 

u{x,t) = u{(p{x,t),0) , (4.4) 



for the solution to ( |4.1| ), in terms of the initial value of li at t = 0. 

For a linear PDE, the coefficient of dxU in ( |4.1| ) is a constant independent of u and the 
characteristic curves are parallel straight lines. In the general case, when the coefficient of dxU in 
([4.1| ) is an arbitrary, say smooth, function of n, the slope of a given characteristic curve depends 
on the initial value of u at the starting point. Thus as time evolves, the characteristic curves can 
intersect and a shock may develop. It seems that this can be prevented only if the "velocity", ^, 
can be made independent of the coordinate normal to the characteristic. 

5 Exceptionality 

Even though we have neither found a proof in the literature nor been able to prove it rigorously, this 
"method of characteristics" seems to extend to first order PDEs in higher dimensions . Although 
there doesn't seem to be such a construction for the matrix system ( |2.lD , we want to carry on our 
discussion and see what can be done. 

Motivated by this 2-dimensional example, let us look for situations where the characteristic 
surfaces do not cross as they evolve, hence shock waves do not develop. Following the reasoning 
given above, we can demand that (locally) the characteristic eigenvalue, which after all is analogous 
to the "velocity" in the given example, is independent of y? in the evolution, or that 

Now let us look at the homogeneous case in ( p.l| ) (i.e. B = Q). Then taking a particular root 
Pq = /i*-^°) of the characteristic equation ( |3.14| ), defines a family of surfaces (remember p^ = d^ip^^'^') 

ip^^°\0, x) = constant . (5.2) 



Assume that U is just a function of (^. Then (p.lD imphes (by ;B = 0) that 



{A^V^) ^ = , (5.3) 



and hence 



f^ = x(^)(^)i?(^), (5.4) 

dip 

where W-"^' is the corresponding right eigenvector for the given root. U and i?'"^' being A^-vectors, 

we end up with N ordinary differential equations. We can always assume that w- ' has a nonzero 

component, and that particular component Uk can always be chosen such that it is equal to ip. 

Since the eigenvector W- ' is known as a function of U and c^, the ratios of the other components 

determine Ua = Ua{Uk) (^ 7^ K). The particular component Uk itself can be determined from 

the characteristic equation. Now a solution for U obtained in this way is called a simple wave |2|. 

This brings us to the so called exceptionality condition |^]. Let us first define what is meant 
by that: 

The wave corresponding to a given characteristic root is called exceptional if it is such that 

(Vf/A) • i? = . (5.5) 

Moreover when all the N wave modes are exceptional, the system is said to be CE 0. 
So for simple wave solutions of the homogeneous case just discussed, exceptionality condition 
is just the statement that Vf/A is orthogonal to the corresponding right eigenvector. In light of 



(5.1) and the discussion that led to it, exceptionality condition does after all seem to "justify" a 
generalization of the (naive) idea we developed to prevent the development of shocks using the 
D = 2 example (at least for the case of simple waves). 

Another way of looking at the problem may be provided by the following: 

From (^]^), it follows that 6U can be expressed as a linear combination of right eigenvectors 



as 



6U = TT^Ri (5.6) 

for some components vr-'^ (/ = 1, . . . , N) (also called as the coefficients of discontinuity). In general, 
one would expect these coefficients to evolve according to a nonlinear differential equation. In 
Appendix A, it is shown, how the CE condition can also be viewed as the statement that the 
coefficients of discontinuity evolve according to a linear ODE, thus the characteristic curves are 
prevented from intersecting locally. 

We now briefly mention another alternative approach developed in y] for a "covariant for- 
mulation" of exceptionality. Let us choose a particular root po = h^'^\U,pi) for some J in (3.14). 



We also have by ( |3.4| ) that po is proportional to A. If we now take the field-gradient, V[/, of the 
"Hamiltonian" H in (3.14) and set pQ = h^ ' afterwards, we see that only the term which is pro- 



portional to Vf/po ~ VjyA does not vanish in the resultant expression. For a simple wave then, 
contracting this with 6U and using (2^), we get 



VuX -dU = 5\ (5.7) 



for this particular root. 



Hence, in light of ( p.l]) and (5^), one arrives at the "equivalent" condition for exceptionality: 
The wave corresponding to a given root is exceptional, if on the characteristic surface H = 0, 
one has 

6X = . (5.8) 

Again for CE, this must hold for all roots, or that 

6H = . (5.9) 

In the following, we apply the above mentioned (two seemingly different) CE requirements 



in a variety of physical cases. In the process we supply the missing details leading to the results 
reported in [Q]. We want to make it clear that (^]^) was originally developed for systems in D = 2 
only 1^. Here, however, we apply ( |5.5| ) and (|5.9| ) to systems in higher dimensions. Although we 
are not aware of a rigorous construction that generalizes the results explained so far to PDEs in 
higher D, it is plausible that such a general proof can be given. 

After all, notice that the characteristic equation, and the condition for CE, are algebraic 
equations which must hold pointwise in any x^. At a fixed point on the characteristic surface at 
a fixed time, the normal fi is a fixed vector and proceeding for arbitrary n, and U, is the same 
as imposing the conditions pointwise. Furthermore, the original system is rotationally invariant, 
where rotations act on U as some linear matrix representation. So, the CE conditions are rotation 
invariant, and having chosen n (i.e. working at a fixed point and time) we can just rotate it to, say, 
the first coordinate direction x^ and proceed to study the eigensystem |^^ — AI| =0, provided the 
system can be brought into a form such that A^ equals the identity matrix. Of course U changes 
in rotating, but the eigensystem is worked out for arbitrary U. 

6 Scalar Field 

We now want to study in detail the CE requirement for a scalar field in D = 4. We first work out the 
problem using the requirement (^]^) , then show that one finds the same answer (with considerably 
less effort) using condition (|5.9p, as was in fact done earlier in O. 
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6.1 The First Way 

Given the covariant action / = Jd'^xL{z), where z = ^{df^a)"^ is the only invariant (in first 
derivatives), r] = (— , +, +, +), the field equations can be written as 

d^ ((a^f7)L') = (a^av) (d.a) (a^a) l" + (a^a^v) l' = o . (e.i) 

Here ' denotes differentiation with respect to z. 

By defining A = doa, B = dia, C = 92^7 and D = dsa (hence z = 1{-A^ + B^ + C^ + D^)), 
we can take U = {A, B,C,D) and write this system in canonical form as 

dt dx^ 

where each M* has elements (with i = 1,2,3; fi = 0, 1, 2, 3) 

1 _ -2ABL" 1 _ B'^L" + V 1 _ BCL" ^ _ BDL" 

"^00 — ' "^01 — Q ' "^02 — > "'■OS — ; 

2 _ -2ACL" 2 _ 1 2 _ C"^^" + L' 2 _ C'^^" 

"^00 — ' ""^01 — "^02 5 ""^02 — 5 "^03 — ; 

3 _ -2ADL" 3 _ 1 3 _ 2 3 _ ^^^" + ^' 

"^00 — 1 ""^01 — ""^03 ) "^02 — ""^03 1 "'■OS — H i 

and © = A^L" — V . Here we have also used the compatibility conditions ^ = ^^ , ^ = ^^ 
and ^ = 14- 

So, by the reasoning given at the end of the last section, we proceed to impose the CE 
condition ( ^.5[ ) using the eigensystem |M^ — AI| =0. ^ The characteristic polynomial of M^ turns 
■^In fact, we showed separately that taking arbitrary fi does not alter the final results obtained in this section. 
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out to be A^(A^ + oiA + 02) = where ai = ^— q — and 02 = q^ — . Apart from the eigenvalue 

at A = (with multiphcity 2), there are two distinct eigenvalues A3,A4 in the general case. Q The 
eigenvectors corresponding to each can be taken as 

ei=(0. ^2^„^^, ,l,0j , e3 = (-A3,l,0,0)^ , 

62= (0,-^2^^^,0,1) , 64 = (-A4, 1,0,0)^ 

which clearly form a full linearly independent set, hence our system is hyperbolic. We next apply 
the CE condition ( ^.51 ) to this eigensystem. Obviously, it will be trivially satisfied for A = 0. For 
the remaining nontrivial eigenvalues, note that by differentiating A^ + aiA + 02 = 0, we can write 

f)\ \9ai_ I 902. 

O-^ -^dUs ^ dUs 



dUs 2A + ai 

and the CE condition J2s du' Cp, s = becomes 

2dai da2 dai da2 

^ ^4+^(^4-5^^-5^ = ° fo^A = A3,A4 

by using the explicit form of the eigenvectors. However we know that A3, A4 satisfy A^+aiA+a2 = 0. 
Hence these two equations must be linearly dependent which implies that 

dai dai 802 ^ , dai da2 „ 
"^^4 + 55-^4=' ^"^ "^a4 + 5S=° 

have to be satisfied simultaneously. 

Substituting the explicit forms of ai and 02, we find after some calculation that 

"'M + lf-M = ^^^^^:^^#^W3BnAV + A»(B^-AV'l = 0, (6.2) 
"'JX + JS = !^^^^—^^^IB{M' + B')L' + BAHb'--A')L"] = 0. (6.3) 

"'For the degenerate case L'[L' — {A^ — B^)L"] = 0, A3 = A4 = —^ but then there is no nontrivial covariant action 



which can satisfy this. Moreover in this case, the system is no longer hyperbolic. 
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The only nontrivial covariant condition we can impose such that these two constraints are 
satisfied simultaneously is 

L'L'" - 3{Ly = . (6.4) 

6.2 The Second Way 

In this part, we want to impose (|5.9D using the formalism developed starting in Section ^ We now 
look for a surface S across which the discontinuity in a is second order. Thus with 52(J = Q, we 
have (cr^ = 5^cj , 99^ = d^(p) 

^(^fi = ^/iQ ■ 

Taking the discontinuity of (j^^) gives 

ip^{{6L')a>' + {6a>')L')=0 (6.5) 

which, with 6z = a^ {ScTfj) = cr^ ip^^Q and 6L' = L" 6z, becomes 

Q (gL' + {a''^,fL")=0 (6.6) 

where Q = '■p^'-Pfi- Comparing this to the previous discussion, we have H{x,p) = G^'^Pfj^pu = with 

(Pm = V^M'Q /O) 

G^''' = t^ L' + a^" a"" L" . (6.7) 



Imposing (|5.9|) (taking the discontinuity) gives 

Q (sg L" + (ctVa)' L'") {cj'^^^) = (6.8) 

and using Q V + {a^ip^fL" = in (U) yields 



-/ Tilt 



Jl Til 

QG (--^ + 3L") (aV) = (6.9) 
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This again leaves us with the condition (|6.4[ ). 

Notice that throughout, we have never used the fact that D = 4. This suggests that ( |6.4[ ) 
is a D-invariant {D > 2) condition. For general Z?, using the requirement ( |5.5| ), one ends up with 
M* which individually have A = (with multiplicities D — 2) and the remaining two nontrivial 
eigenvalues (corresponding to the pair of canonical variables {a, vr) for the only degree of freedom 
of the theory) with their corresponding eigenvectors yield (|6.4| ) when inserted into (|5.5| ). 

To find the solutions of ( |6.4D , we first note that by defining X = L', we can write it as 
X^ ( w ) =0 which will be satisfied nontrivially provided X' = or f ^ j = . Integrating 
these simple equations, we find X = ci ,L = c\z + C2 or (^ j = — 2c3 , -^i = —^c^z + C4 ,L = 
lb— "v/— 2C32; + C4 + C5 for Cp(q = 1, . . . , 5) arbitrary integration constants. Choosing these constants 
suitably, we note the particularly interesting cases as L = —z = — |((9^cr)^ and L = 1 — \/l + 2z = 



1 — Jl + {d^aY = 1 — a/— det[7?^jy + {d^a) [dyu)]^ the scalar analogs to Maxwell and Born-Infeld 
electrodynamics, respectively. 

7 Nonlinear Electrodynamics in D = 4 

We now come to, our most physically important example, the D = A Abelian gauge vector theories. 
Any gauge invariant action, depending on F^y = d^Ay — dyA^ but not its derivatives, has the form 

I[A^]=Jd''xL{a,P) , a^]^F^yF^^ , p^-^F^,*F^- , *F'^- ^ V-^-F.^ (7.I) 

Here subscripts on L mean differentiation with respect to the (only possible) invariants a or /3 and 
with our conventions e°^^^ = +1, ri^y = (— , +, +, +), a = B^ - E^, [3 = — B • E with E^ = F"* and 
B' = ^e^^^Fjk. 

We first drop the /3-dependence of L, show in detail how the CE condition (|5.9| ) is applied to 
-L(a), then reinclude /3 and carry out the CE condition ( |5.9| ) for full L{a,f3). [Again we originally 
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studied this problem using the requirement (5.5) which is quite laborious and tedious. We show in 



the Appendix B, the general outline of how ( |5.5[) is carried out for L(a). We don't show how ( ^.5[ 
is applied to the most general case, L{a,l3), although in this case we were able to prove, at least 
the sufficiency of ( [7.14|) and ( [7. 151) using (^).] 



We look for a hypersurface 5 across which the discontinuity in A^ is second order. Hence, 
with (^2^^ = ''T/i, we have 

SF^"" = ipf" it" - ^^ tt'' , 5 *F^'^ = e'"'''^ ip„ iTr (7.2) 

and 

6a = 2 F'''' ip^ vr, ,613= *F'"' ip^ n, . (7.3) 

7.1 L{a) Case 

For L = L{a) only, the field equation is simply d^iF'^'^L') = with the Bianchi identity d,^ *F^^^ = 0. 
[Here, ' denotes differentiation with respect to a, of course.] 
Taking the discontinuity of the field equation, we find 

-2U^'J'L" + i^p^ tt") ip^' L' - g ^T^' L' = (7.4) 

where we have used U^^ = F^^ ipx ,G = y^^y^^j. and JT = F'^'^ ipx iTf^ = If^ t:^- [Taking the discontinu- 
ity of the Bianchi identity, one can see that it follows automatically.] 



Now contracting (7.4) by —U^ (and assuming JT / for the general case), we find 

H = 2uL" + gL' = Q (7.5) 

where we have defined u = U^U^. Now 6u = 2U^ 6U^ = 2Q T and 6a = 2T. Hence imposing 
dU) gives 

<5// = J^(4uL'" + 6gL") = (7.6) 

15 



and substituting for u using (|7.5|), we end up with 



6H = 2J^g Ul" -^L'"^ =0 (7.7) 

Hence we again find ( |6.4[) in a new disguise, whose solutions we can immediately copy as L{a) = 
k + {d + ca)2 (for arbitrary constants k, d, c) apart from Maxwell, L = — ^a (or L' = constant). 
We remark that in D = 3, where a is the only invariant, this is also the CE result, there one 



also has Vl + « = \/— det[r/^,y + -^^i/]- In D = 2 there is of course no propagation for any L{a) 
and correspondingly no restrictions are imposed. 

7.2 L{a,P) Case 

We now want to study the full Lagrangian L(a, /?). For this case, the field equation is di, {LaF^'^ + 
^Lp *F'^'^) = 0. Taking the discontinuity, we find 

F^^- ^, {5L^) + ^, {6Fn L, + ^ *F^- ^, {5Lp) = . (7.8) 

Using ( [7.2[ ) and ( |7.3|) , this becomes 

- U^' (2^L,„ + xLa/j) + 95. i^^T^" - ^''7T^')La -\v^ {2TL^(i + xLf3^) = (7.9) 

where we have used V^^ = *F^'^ if\ and x = *F^'' (f\ it a = V" -Ka- 

Now contracting (|7.9D by — f/^, and then by — V^, we get respectively: 

J'{2uLo,a.+gL^ + [igLo,p)+x[uLo,p + ]^(3QLp^ = 0, (7.10) 
r {2pgL^^ + L^p{u-ag))+x(pQL^p + gL^ + ]^Lpp{u-ag)\ = 0, (7.11) 

where we have made use of the identities U^Vfj, = (3g and V^V^ = u — ag. 

For this system to have nontrivial J- and Xi the determinant of the 2x2 matrix, that comes 
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from writing ([7.10|) and ([7. Ill) as {J- x) M = 0, must vanish. Hence we have yK = LaaL/^fs — L'^g) 



H = u^K + uG 



2La [ Laa + J^PP ] " <^K 



+ Q' 



L„ (l^ + 2[3L^p - ^aLpfA - P^K 



Notice that for the discriminant, one gets 



g2=l i-La (4 L^a - Lpp) + 2aKf + A [-L„ L^p + pxf 



For the case A = 0, i.e. when 



= 0. 

(7.12) 



(7.13) 



La (4^ Laa — Lgg) + 2a [Laa Lpp — Lap] — , 
— La Lap + /? [Laa Lpp — L^p] = , 



(7.14) 
(7.15) 

H takes the form H = K {u — h)"^ = and for i^ 7^ 0, it follows that ( |5.9| ) is satisfied automatically. 
Hence any L that fulfills ([7l4D and (fTlSl ) is CE. 

The differential constraints ( [7.14 ) and ( [7.15| ) were actually found a long time ago in different 
contexts @, ^, |3|. Bialynicki-Birula ||5| discovered these equations by studying the propagation of 
weak electromagnetic waves on a strong, constant field background. He showed that they were 
necessary for both polarizations of light to propagate according to the same dispersion law; he calls 
these as the "no birefringence" conditions. Plebaiiski |Q], studied the theory of small perturbations 
and their discontinuities in nonlinear electrodynamics and considering all possible cases for the 
form of the background field (e.g. null, algebraically general) and constraining the system with 
physical conditions such as causality along the way, proved the necessity and sufficiency of these 
differential constraints for the excitations of light to propagate according to a single characteristic 
equation, with coinciding characteristic surfaces. Boillat ||3| found these conditions using equation 
(|5.9| ) explained in this paper, and demanding that it be expressible as a complete square as explained 

in O) - ( fTTsI) . 
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The work of Plebanski involves an extensive study of characteristic surfaces, which is what 



CE formulation is all about, in nonlinear electrodynamics, so it is not surprising that he finds (7.14) 
and ( |7.15| ) as the conditions to have coinciding characteristic surfaces; after all that is also what 
Boillat gets using the CE viewpoint. Bialynicki-Birula effectively allows the discontinuities in terms 
of weak disturbances about a generic background. It is not surprising to see that having the same 
dispersion law for both polarizations implies having a single characteristic surface for the evolution 
of discontinuities. Apart from these historical details, we will call the two conditions, ( [7. 141 ) and 
(7.15|), the "strong CE" conditions from now on, because of this extra physical constraint that they 



impose on the system. 

The solutions of ( [7.14 ) and ( 7.15| ) are important to define physically acceptable models of 



electrodynamics. It is clear that the Maxwell action, iMax = —^ J d'^xa, is indeed a solution, and 
it was realized in [^, ^, ^ that another is the (once again popular) Born-Infeld action [^], 

Ibi = J d^x (1 - y^-det[r?^, + F^,]) = J d^x (1 - ^Jl + a-p^) . (7.16) 

However, these are not the only solutions, unless one further requires that they reduce to I Max for 
weak fields. Otherwise there are additional solutions such as L = a//3. [As shown in |l|, without 
requiring the weak field condition, imposing strong CE with duality invariance (a property shared 
by both of these theories), singles out Maxwell and Born-Infeld.] 

Now we continue with the general case K ^ 0, A ^ 0. For convenience, we define 

P = 2La Uaa + ^LpfA -aK , R = Lo,\L^ + 2(3L^p--aLpp\-0^K , 

p = 2 Laa , q = La+ (3 1^/3 , r = La/3 , s = - (3 Lpp 

and rewrite H as 

H = u^K + ugP + g^R = (7.17) 
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Now imposing ( |5.9| ), we find 

6H = u^{2TK^ + xKf3) + uGiiJ'K + 2^P„ + xPfs) + g\2J^P + 2^i?« + xRfi) = (7.18) 

where we have used 6u = 2QJ^ , 5a = 2F and (5/3 = x- 

Now using V? = - ^{uP + GR) (from ([tItD ) and x = - f^rif (f^°™ (|7lc|)), we find that 



(7.1§|) is simplified into a form (5/7 = uQQi + Q C2 = 0. Since K ^ {) and A 7^ 0, this imphes that 



Ci and C2 rnust vanish simultaneously. [This can also be seen as the requirement that ( [7.17 ) and 



( 7.1S| ) be linearly independent.] Finally one finds that, CE requirements (corresponding to (^^)) 



are 



2Ka{rP^ - sPK - rRK) + Kp{qPK - pP^ + pRK) + 2P^{sK'^ - rPK) 

+KPf3{pP-qK)+2Ra(rK^)-Rp{pK^)-2rPK^ + AsK^ = (7.19) 
2KairPR - sRK) + Kp{qRK - pRP) - 2P^{rRK) + Pp{pKR) 

+2Ra{sK^) - Ri3{qK^) - 4.rRK^ + 2sPK^ = (7.20) 

In Appendix C, we give these equations in terms of L and its derivatives only. Notice that 
these equations are quasilinear (linear in the third order derivatives of L) just like ( |6.4D . Being 
of third order, they are of course weaker than ( [7.14 ) and ( [7.15 ). Born-Infeld, of course, satisfies 



these equations but we have neither been able to solve them in the general case, nor for the more 
restricted situation when one also demands duality invariance. For the latter, one would expect to 
get two (or, with a bit of luck, only one) ordinary differential equations involving only a-derivatives 
when one substitutes for /3-derivatives by using the duality invariance constraint (see |Q]) and its 
(q, /3) derivatives, recursively. 

An application of CE, rather than strong CE, comes from theories involving the (neu- 
tral) scalar plus the Abelian vector field, where possible invariants are {a,l3,z (= ^(9^o")^),y = 
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^{F/iu(^'^)'^)- For a Lagrangian L{a,P,z), the CE conditions further require L^a = = Lzf3, 
which reduce it to the noninteracting L[a,j3) + L{z) form. Having the "fuhy Born-Infeld" form 



det[rjni, + F^u + afj^a,y] in mind, one can consider L{a, /3, y, z). It turns out, however, that there 
are no CE actions with nontrivial dependence on the other possible variable y = ^{Ff^u'^'^)'^ ■ Thus, 
CE alone separates the two systems and imposes the previously stated constraints on their forms. 

8 Gravitational Models 

Finally, we turn to gravitation. For Einstein's gravity in vacuum, as well as the linearized theory, 
the gravitational waves are CE, the characteristic surfaces describing discontinuities being null (see 
e.g. HI). It can be shown that this result holds for any D > A. [For D = 3, there is of course no 
propagation and no restrictions are imposed.] One can further look at pure gravitational actions of 
the form / d'^x{pR'^j^ — qB?■)^J—g in D = 4 and / d^xf{R)y/^ in Z? > 3 and show that the same 
conclusion remains unchanged. 

To reduce these theories to a first order system would be inconvenient, but is fortunately 
made unnecessary by a simple extension of the previous discussion. Clearly, if we rebuilt the 



original higher order equations from the set (2T), we would simply have the situation that all the 



derivatives of the field are assumed continuous except the highest one. 

We first sketch the Einstein case to establish notation. Considering a second order disconti- 
nuity in the metric across some characteristic surface </? = 0; ^29 ^u = t^i^v, we have {ip^ = d^ip) 

1 



2 



{ff,fXTT V + H^v^XT^ ii - H^yiHivT^ X " ^X^ T^, 



liuj 
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and 

which imphes for 

SqG^^ = 5o{R^u — -Qfj^yR) = 5oR^^ — -g^u^oR = 



1 r 



^oGfii^ — 2 



2"^" ' " '^^ 2^ 



ffifXT^ v + <Pu<f\T^ ti-V^fiiPuT^ \-V^Xf T^tiu- g,j.u{'P''f'^TraT -V^aV"'^^ t) 



. 

f8.ll 



In the harmonic gauge g'^^'^T" ^j,y = 0, one finds that its first discontinuity imphes 

271^^^^ - ^'^ mV'" = (8.2) 

Multiplying this by gua^r + gur^a, one gets 

^fi'fXT^^ V + ^v'^\T^^ iL - •^ii.'^vT^^ A = , (8.3) 

whereas contracting by (py^ one finds 

'/'^^"vr^. = ^Pi.^'^T^'' u . (8.4) 

Using (U) and (O) in (|]l|), one ends up with 



^^G^u = -{ifxip^TT'"' + -g^yifx^K'' ,) = 



(£±2),„ „A„. 
^" 4 



Hence taking the trace 

6oG^' ^ = '^ ; "V aV^-^tt" , = 

The discontinuity in (7^jy is arbitrary, hence vr'^ ^ 7^ 0, which imphes that ifx^p^ = 0. This tells 
that the characteristic surfaces are null: the discontinuities travel with the speed of light in all 
directions. The same holds for the linearized version of the theory as well of course. 
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For generic quadratic Lagrangians {pR^u R^'^ ~qR'^)\/~9 in D = 4, using similar steps (writing 
the field equations, choosing harmonic gauge as before and utilizing the identities (^.S]), (^.41)) one 
finds that {Q = ip^(px , vr = vr'** a) 

/I p \ p \ 

Qilj^ip- 2q)ip^ip^TT - -Qtt^u - i^giiviij^ - 2g)<37r 1 = . (8.5) 

Taking the trace, one gets Q^vr(p — 3g) = 0. (The choice p = 2>q corresponds to Weyl-tensor 



squared; the scalar degree of freedom is absent.) For p = 3q, (|8.5D becomes 



1 3 1 

Since Ti^y is arbitrary, we see that again Q = 0, as in Einstein, so Q = characterizes both Einstein 
and the quadratic action. 

Finally, we consider the class of actions ^ d xf{R)^J—g in D > 4, whose field equations are 

^M. = R^.uf - \g^..f + (^/..V^V^ - V^V,)/' = . 

Hence the order of highest derivatives is four. Following similar steps by taking d^g^y = tt^i,, we 
find the same expressions for 63T ^^ and 52Rfiu as for ^iF ^^ and 5qR^u in the Einstein case. 
Using these, we get 

SqE^^ = {Qg^u - ^^lV>u)i^''v>'^^^aT - Qvr)/" = . 



Going to harmonic gauge with identity ( ^.4| ) and taking the trace, one gets 

6oE^ , = ^l^Q\f" = . 

Here too Q = is the only solution, and so for a wide class of gravitational actions the propagation 
obeys the Einstein behavior as well. As is well known, these systems are variants of Brans-Dicke 
scalar-tensor theories so their "good propagation" is not surprising. 
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Appendix A 

In this Appendix, it is shown how the CE requirement (|5.5| ) causes the coefficients of discon- 
tinuity to evolve according to a linear ODE. 

Consider the wavefront at the boundary of a region with smooth enough solution U . The 
following derivation fills a gap in Q and generalizes [^ where the evolution of discontinuities in 
first derivatives of the dependent variables is studied. Choose a root of the characteristic equation, 
Pq = h^^{U,pi). Differentiating (2.1) with respect to 99, and contracting with the corresponding left 



eigenvector, we have {A, B,C = 1, . . . , N) 

(d^Uc) L'a {yuc^'XB) {B^Ub) + L'aA% (d^d^UB) + (d^Uc) L^ (Vc/c^a) = . (A.l) 

Now we can take the discontinuity of this equation. We have higher derivative terms, but notice 
for the term in the middle that (93^ = d^ip) 

d^d^UB = M^^Ub) + {d^^^){d^UB) + (d^^iPW^.d^UB) + (d^d^i^Wi^^UB) ■ (A.2) 



The first term on the right hand side of ( |A.2| ) vanishes when contracted with A^ against the left 
eigenvector. Thus, there is just one ip derivative (i.e. no 5^ pieces), and the discontinuity can be 
taken as before. We first compute the following to use for the first term in ( [A.lJ ) 



[{d^Uc) {O^Ub)] = {6Uc) ^^ (SUb) + {5Uc) {O^Ub) + {d^Uc) V^m (^Ub) . (A.3) 
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Now using (^), ( |A.3| ) and taking the discontinuity of ( |A.1| ), we find 

La ^\b (df^tpW^^^UB) + mV^ + Vf. i^Uc) Li {VucA%) {6Ub) = . (A.4) 

[Here the first term comes from the third term in ( [A.2| ), the last term comes from the first term in 



( A. 3 ) and we have cohected as "m", the coefficients of terms hnear in n without derivatives, ^^m^j" 



are determined by the background solution, as well as the extrinsic geometry of the characteristic 



surface.] Let's examine the other terms in ( A.4 ) 



The first term in ( |A.4| ) is (up to a redefinition of the coefficient matrix m) 

{L\A%Ri)[d^i,'){d^.T:j) = {L'j,A%R'B){d^TTj) . (A.5) 

By taking the pi derivative (i.e. applying 3p-) of the straightforward equation L^A^j^^ ^sPtJ- = 0; 
and using ( |3.7D , one finds 

LiA\sRi = -^''^ . (A.6) 



Hence using the equations for the trajectories ( p. 15 ), the first term in ( [A.4| ) reduces to ■^, where 



d d dx' d 

ds ~ M ds dx^ ■ 

For the last term in ( [A.4| ), we have (by making use of L^Aj^^ Ll^BP^J, = 0) 

^,i6Uc)L'A{VuaA%)i6UB) = -{5Uc)L'aA%{5Ub){VucV^.) 

Notice that the last factor has U dependence via the characteristic root po- Hence using ( p.4| ) and 
(P) (with dH)) 

^^ (SUc) L'a {Vuc^^) (^Ub) = (SUc) 5" 7TJ \Vip\ (Vf/eA) 

= \Vip\TT'7TjRl,{Vuc^) (A.7) 

Finally then, we have a nonlinear equation for the evolution of the coefficients of discontinuity 
along rays, 

— + m^jn-^ + |V(^| TT^ vrj Ri (Vt/^A) = . (A.8) 
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This is computable because all "t/'s" above are actually "C/'s". 

Thus, we recognize that CE condition can also be viewed as the statement that the coefficients 
of discontinuity evolve according to a linear ODE. 

Appendix B 



In this Appendix, we show the general outline of how ( |5.5D is carried out for models of 
electrodynamics that depend only on the Maxwell invariant, i.e. L = L{a). 

By taking U = (E, B) and looking only at the spatial components of the field equation 
dy{F^^ L') = and the Bianchi identity di, *F^'^ = (i.e. setting fj, = i), we can write this system 
in the form H^ 4^ = where H'^ are 6x6 matrices. For this new system (as in the scalar field case 
when we had 2(= 4 — 2-1) nontrivial eigenvalues corresponding to the pair of canonical variables 
for the only degree of freedom of the theory) we expect to get A = eigenvalue with multiplicity 
2(= 6 — 2-2) for each individual H* because of the 2 degrees of freedom. 

Just as was done in the scalar field case, we only take H^ to start with. Hence we have 



H° ^ + Hi ^ = where 



H" 



P Q 



V 



I 

which have elements (with i,j = 1,2,3) 



and H^ 



S R 



a 



Pij 



i%j 



^ij 



zEiEjLi — uij-L , 



-2EiBjL , 



"^^likEjBkL , 

— eii/c i'^BjBkL" + 5jkL') , 



cr, 



ij 



-^lik , 
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and I is the 3x3 identity matrix. 
Multiplying by 



(H 



0\-l 



/ p-l _p-lQ » 



we bring this system into the canonical form I 4P + W ^^■ = where 



0\-1tt1 



W = (H'^)"^H 



' p-i(S-Qc7) P-IR > 



V 



a 







/ 



Then the characteristic polynomial of W turns out to be, just as predicted, of the form 

A^(A^ + csA^ + C2A^ + ciA + cq) = 0. The eigenvectors corresponding to each A^ can be taken to be 

/ \ 
a 



vV 



where 



Ai = : ai = , bi 



^^ 



y2 



\y3 J 



with 



y2 
yysj 



r22r33 - r 23^32 



V 



^33 -r23 
-r32 r22 



\ ( \ 

^21 



A2 = : a2 



^^ 



w 



^^ 



Z2 
\Z3 J 



with 



Z2 
yZ3J 



^22^33 - -^23^32 



^^33 -^23 
-^^32 ^^22 



and for A^ / (s = 3, 4, 5, 6) 



\ 



/ 



As(/522 - As723) 

\^ As(p23 + As(-As +722)) J 
where jij = [P^^ (S - Qa)]ij and pij = [P^^ Rjjj. 



, bs = —aa.s 



\ ( \ 

S21 

y.3iy 



\ 



-(/>23 + As(-As +722)) 
i^ {P22 - As723) j 
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Clearly these eigenvectors form a linearly independent set. By differentiating A^ + c^X^ + 



C2A^ + ciA + Co = 0, we get 



dXr 



(Ap 



1 3 dc3 



, (\ \2 dc2 I \ dci I dcQ 



dU. 



4(Ap)3 + 3c3(Ap)2 + 2c2Ap + ci 



OAj, 



Substituting this into the CE condition (5^) X^s ajf" ^p, s =0 gives a polynomial of order 6 in A, 
but by using A^ + csA^ + C2A2 + ciA + cq = repeatedly, one can reduce this to a polynomial of 
order 3, whose coefficients must be set equal to zero simultaneously. 

Doing so, we find that the only nontrivial covariant condition, we can impose such that these 
coefficients vanish simultaneously, is 



L'L'" - 3(L 



ll\2 



. 



(B.l) 



Appendix C 



Here, for completeness, we present ( [7.19 ) and ( 7.20| ) in terms of L and its derivatives only. 



They become {K = L^aLpp - L\ 



-^LaLa/BjS yLa{l6L^^La0 + SLaaL^p + L^^Lpp) 



-K 






-K 



8aLl. + pLpp{12Ll. + L 



73 T ^(3/3) 






-K 



SaLaaLajS + (^{^L^p + SLaaLpfS + -^/3«) 



LaLppji fLQ,(16L^^ + 12L^„L^^ + L^^ - ALl^^Lpp) 



-K 



8aLl^ + pL^piULl^ + L 



a/3 J 



--{4L^^ + Lpp)K^[Lo^Lo^p-PK] = (D.l) 
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and 



+PLaLai3[16LaaLf^p + GL^pLf^p — 2LaaLpp 

—(iK —aLapLpp + 2f3{ALaaL^p + 2L^pLpf^ + LaaLp^ 



+ -^LaLai3f3 [i^L^^ + L^p){2L^Lap — aLaLapLpp) 



-PKLai3 —aLajsLpis + 2/3(4L„^ + L^p + 2LaaLpi3) 



+2(3Lot{SL^^L^p + 21/ Q,^ + LaaLppL^p - L^^L 



pp, 



\ ^J-ia^aaa 



^^al3 + ^Pf3)i'^^a^aP — OlLaLapLpp) 



+(3La{\-QL^p + ^L^pLpp — 2LaaLpp) 



-LaLi3j3i3 ( 2Lq,Lq,q,(4Lq,q, + 2Lq,^ — LaaLjsp) 



respectively. 
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